1Multidimensional Dependencies in Classi cation and Ordination

نویسنده

  • Carles M. Cuadras
چکیده

1 ABSTRACT The relations between two distance matrices on the same nite set are analyzed, via metric scaling, by correlating principal axis. Some applications are given and illustrated with examples. 1.1 Introduction Dissimilarities, similarities and distances are fundamental concepts in mul-tidimensional scaling and related topics. Euclidean and Mahalanobis distance also play a basic role in techniques such as regression and discrimination. In many cases, distances are computed by observing variables on individuals, but in general, the dependence between variables is not taken into account. Mahalanobis, and its extension Rao's distance, is an important exception. This distance between two observations x;y, say, (x ? y) 0 ?1 (x ? y) depends on the covariance matrix , hence its computation is not possible when the variables are categorical, binary or mixed. In such situations, distances are obtained from similarities such as Jaccard (binary), matching (categorical) and Gower (mixed) coeecients. Other coeecients are possible, but none of them has the property of including the relationships among variables. Motivated by this problem, CF97b] recently introduced related metric scaling, a new multidimensional scaling method to represent objects when two distances are deened on them. This method is based on the construction of a joint distance that has some compatible properties, especially identifying and discarding redundant information.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multidimensional Dependencies in Classi cation and Ordination

The relations between two distance matrices on the same nite set are analyzed via metric scaling by correlating principal axis Some applications are given and illustrated with examples Introduction Dissimilarities similarities and distances are fundamental concepts in mul tidimensional scaling and related topics Euclidean and Mahalanobis dis tance also play a basic role in techniques such as re...

متن کامل

On the Difference between Optimal Combination Functions for Verification and Identification Systems

We have investigated di®erent scenarios of combining pattern matchers. The combination problem can be viewed as a construction of a postprocessing classi ̄er operating on the matching scores of the combined matchers. The optimal combination algorithm for veri ̄cation systems corresponds to the likelihood ratio combination function. It can be implemented by the direct reconstruction of this functi...

متن کامل

A Comparison of Event Models for Naive Bayes Text Classi cation

Recent approaches to text classi cation have used two di erent rst order probabilistic models for classi ca tion both of which make the naive Bayes assumption Some use a multi variate Bernoulli model that is a Bayesian Network with no dependencies between words and binary word features e g Larkey and Croft Koller and Sahami Others use a multinomial model that is a uni gram language model with i...

متن کامل

Inductive and Bayesian learning in medical diagnosis

Although successful in medical diagnostic problems inductive learning systems were not widely accepted in medical practice In this paper two di erent approaches to machine learning in medical appli cations are compared the system for inductive learning of decision trees Assistant and the naive Bayesian classi er Both methodologies were tested in four medical diagnostic problems localization of ...

متن کامل

Bayesian Inference in Mixtures-of-Experts and Hierarchical Mixtures-of-Experts Models With an Application to Speech Recognition

Machine classi cation of acoustic waveforms as speech events is often di cult due to context-dependencies. A vowel recognition task with multiple speakers is studied in this paper via the use of a class of modular and hierarchical systems referred to as mixtures-of-experts and hierarchical mixtures-of-experts models. The statistical model underlying the systems is a mixture model in which both ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007